Microsoft Word - A New Language Model For Automatic Arabic Speech Recognit¡¦
نویسنده
چکیده
A new language model for Arabic language for large vocabulary automatic speech recognition (ASR) is introduced. The derivative future of the Arabic word is quite useful in dividing the process into two phases. In phase-1 the fixed words, the prefix, the suffix and the form of the derivative words are determined through phase-1M-gram, of course, given the acoustical data. In phase 2 another M-gram is used to determine the roots of the derivative words. The idea was tested on 60 words (10 roots x 6 forms). Results are encouraging the idea, and more work is to follow to realize a complete large vocabulary ASR for Arabic language.
منابع مشابه
Automatic Construction of Persian ICT WordNet using Princeton WordNet
WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...
متن کاملOff-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملCompletion of Japanese sentences by inferring function words from content words
A method of generating a Japanese sentence by inferring funct ion words from content words using valency pa~terns is presented. A procedure for selecting an appropriate function word, on the assumption that correct content words have been selected for a given phrase lattice, is described. A method ol ~ inferr ing a correct verb when verbs are recognized less accurately than nouns by the speech ...
متن کاملSpeech Recognition System of Arabic Digits based on A Telephony Arabic Corpus
Automatic recognition of spoken digits is one of the difficult tasks in the field of computer speech recognition. Spoken digits recognition process is required in many applications such as speech based telephone dialing, airline reservation, automatic directory to retrieve or send information, etc. These applications take numbers and alphabets as input. Arabic language is a Semitic language tha...
متن کاملArabic speaker-independent continuous automatic speech recognition based on a phonetically rich and balanced speech corpus
This paper describes and proposes an efficient and effective framework for the design and development of a speaker-independent continuous automatic Arabic speech recognition system based on a phonetically rich and balanced speech corpus. The speech corpus contains a total of 415 sentences recorded by 40 (20 male and 20 female) Arabic native speakers from 11 different Arab countries representing...
متن کامل